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Abstract. This study aims to identify and analyze the ways in which semiotic 
resources are orchestrated by teacher-trainees during videoconferenced French 
foreign language teaching. Our corpus is made up of six weeks of interaction 
between seven teacher trainees enrolled in a master’s program at Universite Lumiere 
Lyon 2 in France and 12 undergraduate business students at Dublin City University 
in Ireland. 15.5 hours of video data were transcribed using ELAN, and 195 lexical 
explanation sequences were identified. These sequences were categorized according 
to whether the lexical problem came from an inability to understand or an inability 
to speak, and were then split into phases according to a three-step canonical model 
(Lauzon, 2008). During each phase we look at which semiotic resources were used, 
combined and co-contextualized by the teacher trainees. Our methodology, which 
will be used to complete our PhD project, is outlined here as well as preliminary 
results. 
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1. Introduction 

It has been shown that during native-non-native and non-native-non-native 
interaction, negotiation sequences are most commonly initiated by lexical triggers 
(Nicolaev, 2010; Smith, 2003). Various models for negotiation sequences have 
been proposed (Smith, 2003; Varonis & Gass, 1985) to describe how a negotiation 
sequence unfolds in time. Lauzon (2008), for example, presents a three-step 
canonical model specifically for lexical explanation sequences during teacher- 
student interactions. A lexical explanation sequence occurs when the normal 
progression of a conversation or activity comes to a halt and participants orient 
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their focus towards a lexical item that a learner fails to understand or fails to 
produce. In the first case, the non-understood lexical item comes from the teacher’s 
discourse or from a pedagogical support such as a video, audio track, text or image. 
In the second case, failure to produce a lexical item hinders the learner’s foreign 
language output. The teacher must intervene to explain the non-understood lexical 
item or provide a sufficient one in order for communication to resume. In Lauzon’s 
(2008) three-phase canonical model, there is an opening, a nucleus, and a closing. 
During the opening phase, the lexical item that is either misunderstood or searched 
for by the learner is recognized by the participants and problematized. During 
the nucleus, the teacher attempts to explain the non-understood word or provide 
the correct word that is being searched for by orchestrating various linguistic and 
multimodal resources. During the closing phase, the learner’s comprehension or 
ability to properly use the just-provided word is evaluated or ratified by the teacher. 

Of interest to us are the ways in which the different phases are multimodally 
orchestrated by the participants. A multimodal phase (Baldry & Thibault, 2006) 
is defined by semiotic coherence. During each phase, different modes (voice, eye 
contact, posture, proxemics, head movements, facial expressions, gestures, written 
text, etc.) work together in synergy to make meaning, with the hierarchy of modes 
constantly being rearranged and renegotiated by the participants (Norris, 2004). 
According to Norris (2004), a mode has high modal density if it carries most of 
the meaning at a given time or if communication would come to a halt if it were 
removed. In a videoconferencing environment such as Visu (Guichon, Betrancourt, 
& Prie, 2012), these modes are channeled through the affordances of the platform. 
In our case, Visu offers a video channel, an audio channel, a text chat channel, 
and the ability to send prefabricated instructions and multimedia documents 
such as images and links to external websites. The aim of this study is to identify 
what modes are used for what purposes during each phase of lexical explanation 
sequences and to explore the ways in which the modes interact with each other. 


2. Method 

2.1. Corpus 

Our corpus, which is part of the ISMAEL project (Guichon, Blin, Wigham, & 
Thouesny, 2014), consists of six weeks of videoconferenced interactions mediated 
by Visu during the fall semester of 2013 between seven future teachers of French 
enrolled in a master’s program for teaching French as a foreign language at 
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Universite Lumiere Lyon 2 in Lyon, France, and 12 undergraduate learners of 
French at Dublin City University in Dublin, Ireland. These undergraduate business 
students had a level B1-B2 in French and were preparing to complete an internship 
the following year in Reims, France. The 40-minute interactions were specifically 
engineered for business purposes and were centered around themes such as paid 
vacation, strikes, coffee breaks, professional experience, project management, job 
interviews, etc. The 28 interactions that were collected have a total time of 15.5 
hours and were transcribed using ELAN (Wittenburg et al., 2006). 

The verbal and text chat channels were transcribed for all participants for the entire 
corpus. Next, we identified 195 lexical explanation sequences and split each one 
into phases using the model mentioned above. The lexical explanation sequences 
were then divided almost evenly into two main categories: sequences resulting 
from a comprehension problem and those resulting from a production problem. 


3. Preliminary results and discussion 

Each multimodal environment has a unique palette of semiotic resources, which 
makes direct comparisons difficult. For this particular multimodal environment, our 
phase-by-phase preliminary analysis shows that a diverse range of semiotic resources 
was used during the opening, nucleus and closing phases for different purposes. 

Gestures, which play a critical role during classroom foreign language teaching 
(Tellier, 2008), were put to use during some lexical explanation sequences. In 
particular, during word searches, teachers sometimes used gesture to verify their 
own comprehension of the word being searched before providing it. For example, 
when asked by a learner how to say ‘yawning’ in French, one teacher trainee, 
unsure of her own comprehension of the English word, used a gesture to verify it 
before providing the French translation (see Figure 1, left). For sequences triggered 
by incomprehension, gestures were used to describe actions, concepts, concrete 
objects, and even explanation strategies. For example, when explaining the verb 
‘to strike’, one teacher- trainee illustrated the act of chanting slogans by moving his 
arm up and down in front of the webcam (see Figure 1, center). On many occasions, 
gestures carried high modal density because they were produced in the absence of 
speech (such as in the ‘yawning’ example) and/or deliberately placed directly in 
front of the webcam. The third teacher-trainee pictured below (see Figure 1, right) 
rarely made gestures visible to the webcam, but in this case made an effort to make 
her emblematic gesture visible when illustrating the concept of adding a zero to a 
monthly paycheck. 
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Figure 1. Gestures produced by teacher- trainees during lexical explanation 
sequences 



Strike 


Earn a little extra money 


"Because actually in France, uh 1 don't know how it is 
in Ireland but in France you receive your salary and 
your paycheck at the end of the month, so when you 
“round the end of the month" you're supposed to 
[add a zerol okay so you have a bigger paycheck.'' 


Yawn in 


"It's when you go [...1‘ 


"It's when people stop working and they go out 
[ in the [street to chant slogans] and want better 
working conditions.” 


The text chat was used during all three phases. Before incomprehension was 
manifested, teacher-trainees often anticipated the difficulty of a word by typing 
it in the text chat window. During the nuclei and closing phases, teacher-trainees 
used the text chat to type the word that was either just explained or provided in 
order to offer a visual representation and reinforce memorization. Text messages 
were not always redundant, often obtaining high modal density when produced 
in the absence of oral speech. Multimedia documents were also sent during all 
three phases of lexical explanation sequences having to do with incomprehension, 
either to present learners with a visual representation before, during or after the 
explanation. 

Finally, we have found synchronization and co-contextualization of resources. The 
text chat was often co-contextualized by other resources: teacher- trainees regularly 
said, “I’m putting it in the chat window” or made gestures pointing to the chat 
window. The sending of documents was similarly co-contextualized by the verbal 
channel. Teacher-trainees tended not to make hand gestures and send written 
messages simultaneously due to the difficulty of typing and making gestures at the 
same time. 


4. Conclusions 

More data must be analyzed in order to proceed to any sort of statistical analysis. 
Furthermore, facial expressions, eye movements, eyebrow movements and posture, 
which are important meaning-making resources in face-to-face interaction, still 
need to be analyzed. For our PhD project, by using the methodology outlined 
above, we will analyze the entirety of our 195-sequence corpus in order to spot 
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patterns, explore variation between the participants, and eventually propose a 
structural model for videoconferenced lexical explanation sequences similar to the 
one proposed by Smith (2003) for synchronous text-based negotiation sequences. 
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